0050/48141 ^ 09/462629 

m^'' 514 RUbCT/PTO 11 JAN 2000 



DNA sequence encoding a hydroxyphenylpyruvate dioxygenase, and 
its overproduction in plants 



The present invention relates to a method of generating plants 
with an elevated vitamin E content by expressing an exogenous or 
endogenous HPPD gene in plants or plant organs. The invention 
furthermore relates to the use of the corresponding nucleic acids 
encoding an HPPD gene in transgenic plants to make the latter 
10 resistant to HPPD inhibitors, and to the use of the DNA sequence 
encoding an HPPD for generating a test system for identifying 
HPPD inhibitors. 

An important aim in plant molecular genetics is the generation of 
15 plants with an elevated content of sugars, enzymes and amino 
acids. It would also be economically interesting to develop 
plants with an elevated vitamin content, eg. an elevated vitamin 
E content. 

20 The eight naturally occurring compounds with vitamin E activity 
are derivatives of 6-chromanol (Ullmann's Encyclopedia of 
Industrial Chemistry, Vol. A 27 (1996), VCH Verlagsgesellschaf t , 
Chapter 4., 47 8-488, Vitamin E). The first group (la - d) is 
derived from tocol, while the second group is composed of 

25 tocotrienol derivatives (2a - d) : 




35 la, a-tocopherol: Ri = r2 = r3 = CH3 

lb, p-tocopherol [148-03-8]: r1 = r3 = CH3, R2 = H 

Ic, Y-tocopherol [54-2 8-4]: Ri = H, r2 = r3 = CH3 

Id, 8-tocopherol [119-13-1]: R^ = r2 = H, r3 = CH3 
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2a, a-tocotrienol [1721-51-3]: Ri = r2 = r3 = i 

2b, p-tocotrienol [490-23-3]: Ri = r3 = CH3, r2 

2c, Y-tocotrienol [14101-61-2]: R^ = H, r2 = r3 

2d, 6-tocotrienol [25612-59-3]: Ri = R2 = H, R^ 



CH3 



CH3 
CH3 



H 
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a-Tocoplierol is of great economic importance. 

The development of crop plants with an elevated vitamin E content 
by means of tissue culture or seed mutagenesis and natural 
selection has its limits. On the one hand, the vitamin E content 
must be detectable as early as at the tissue culture level and, 
on the other hand, only those plants can be manipulated via 
tissue culture techniques which can successfully be regenerated 
into entire plants, starting from cell cultures. Moreover, 
following mutagenesis and selection, crop plants may show 
undesirable characteristics which have to be eliminated by 
bacJc-crossing, in some cases repeated bacJc-crossing. Also, 
elevation of the vitamin E content by means of crossing would be 
limited to plants of the same species. 



Those are the reasons why the genetic engineering approach, viz. 
isolating an essential biosynthesis gene which encodes the 
vitamin E synthesis performance and transferring it specifically 
into crop plants, is superior to the traditional breeding method. 
The conditions for this method are that the biosynthesis and its 
regulation are Icnown and that genes which affect biosynthesis 
performance are identified. 

Tocopherol biosynthesis in plants and algae proceeds in a Icnown 
manner and is as follows: 
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The precursor of the aromatic ring of the tocopherols is 
p-hydroxyphenylpyruvate (3), which is converted enzymatically into 
homogentisic acid (4) with the aid of the enzyme 
hydroxyphenylpyruvate dioxygenase (HPPD), and the homogentisic 
5 acid reacts with phytyl pyrophosphate with elimination of CO2 to 
give the precursor (6). The tocotrienol biosynthesis route starts 
with a condensation reaction between homogentisic acid (4) and 
geranylgeranyl pyrophosphate to give the precursor ( 5 ) . Enzymatic 
cyclization of the precursors 5 or 6 gives 6 - tocotrienol or 5 - 
10 tocopherol, respectively. Some of these biosynthesis enzymes have 
been isolated. 



■i^hile searching for Arabidopsis mutants with defects in the 
carotinoid biosynthesis, a white phenotype mutant was identified 
15 which is not capable of producing active HPPD. If this mutant, 
termed pds2 , is raised in the presence of homogentisic acid, it 
produces carotinoids, like the wild type, and greens (Norris et 
al.. Plant Cell (1995) 7: 2139 - 2149). This work shows that HPPD 
activity is a prerequisite for the formation of 
20 photosynthetically active chloroplasts . Without this enzyme, no 
plastoquinones are formed, which are required as acceptors for 
liberated reduction equivalents during carotinoid biosynthesis 
(phytoene desaturation) . The fact that HPPD has a key role in the 
plastid metabolism makes it an interesting target for herbicides. 
25 Sulcotriones efficiently inhibit the activity of the enzyme 
(Schultz et al., FEES Lett. (1993) 318: 162 - 166). 



Sequences of HPPD-specif ic genes are already known from the 
organisms mentioned below: 
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Organism 


Sequence 
name 


Access number 
database 


Humans 


HPPD_HUMAN 


X72389 


Pig 


HPPD_PIG 


D13390 


Rat 


HPPD_RAT 


M18405 


Mouse 


HPPD_MOUSE 


D29987 


S t r ept omy ce s 
avermitilis 


SA11864 


U11864 


Pseudomonas 
sp. strain 
P.J. 874 


HPPD_PSESP 


P80064 


Arabidopsis 


HPPD_ARAB1 


AF900228 




HPPD_ARAB2 


U89267 
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Furthermore, the following sequences, which show a marked 
homology with HPPD sequences, can be found in the databases: 

PEA3_M0USE: Mus muscula (mouse) PEA3 polypeptide, AC X63190? 

5 

MELA_SHECO: Shewanella colwelliana, melA protein, AC M59289. 

WO 96/38567 describes the HPPD DNA sequence from Arahidopsis 
thaliana and Daucus carota. 

10 

A knowledge of the HPPD DNA sequences is an absolute prerequisite 
both for the use in crop protection for the generation of 
herbicide-resistant plants and for increasing the vitamin E 
synthesis in plants, for example for producing animal feeds with 
15 elevated vitamin E content. 



j/yit is an object of the present invention to develop a transgenic 
plant with elevated vitamin E content. 

20 It is a further object of the present invention to develop a 
transgenic plant which is resistant to HPPD inhibitors. 

We have found that these objects are achieved, surprisingly, by 
overexpressing an HPPD gene in the plants. 

25 

It is an additional object of the present invention to develop a 
test system for identifying HPPD inhibitors. 

We have found that this object is achieved by expressing a barley 
30 HPPD gene in a plant or in a microorganism and subsequently 
testing chemicals for inhibition of HPPD enzyme activity. 

A first aspect of the present invention relates to the cloning of 
the complete barley HPPD gene via isolating the 
35 HPPD-gene-specif ic cDNA (HvSD36). 



During leaf senescence, the vitcunin E content in the leaves is 
markedly increased (Rise et al.. Plant Physiol. (1989) 89: 
1028 - 1030). The monocotyledonous leaf of barley represents a 
40 gradient of cells of different ages since the leaf has a basal 
meristem, from which new cells are formed by successive division. 
Thus, the oldest cells are located at the leaf tip and the 
youngest at the base. Fig. 1 shows a diagram of the primary leaf 
of barley on various days after sowing. The total leaf length 
45 measured can be seen from the scale on the left-hand side. Shown, 
and termed I - IV, are the leaf sections of the primary leaf which 
are differentiated to various degrees and which have been 
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selected for gene expression analysis. The plants were raised in 
a daily light/dark photoperiod (L/D) and, for inducing 
senescence, were excised after 6 days and incubated for 2 days in 
the dark (2 nD) . A "Northern blot" analysis of RNA from the barley 
5 primary leaf from sections which had differentiated to various 
degrees (see Fig. 2) suggest that HPPD expression in barley is 
controlled in a development-dependent manner. Thus, copious 
accumulation of the approx. 1600 nt long transcript takes place in 
the meristematic region on the primary leaf base (I). The content 

10 of this transcript decreases with increasing age of the tissue 
(Ila and lib) and increases again in the fully differentiated 
cells with mature chloroplasts (III). Finally, the content of the 
1600 nt long transcript is highest in the senescing sections of 
the primary leaf (IV). In addition, an approx. 3100 nt long 

15 transcript can be detected only in the meristematic cells on the 
base of the primary leaf. Again, this transcript can no longer be 
detected with increasing tissue maturation. 

With the aid of the so-called "Differential Display" method, a 
20 207 bp cDNA fragment was first isolated whose corresponding 

transcript accumulates in the primary leaf of barley in the case 
of dark- induced senescence. This fragment (sequence protocol: 
sequence ID NO:l: nucleotide position 1342 - 1549) was 
subsequently used as a probe to isolate a cDNA clone with a 
25 larger insert in a cDNA library (in X-ZAP-II) from senescing 
barley flag leaves. 

Diagram of the cDNA subclone HvSD 36 from the X.-ZAP-II library: 
T-7 -*XhoI Sail Hindlll EcoRI Ncol Psd EcoRI PstI BamHI Xbal — T3 

I I I tea ^ ' Btri ' " ' 

35 14 bp 759 bp 14 bp 

The CDNA fragment (sequence protocol: Sequence ID NO:l: 
nucleotide position 771 - 1529) was cloned into the EcoRI 
40 cleavage site of pBluescript (SK") . In addition, both ends of the 
cDNA are equipped with a 14 bp adaptor sequence which was required 
for ligation into X-2AP-II. Selected restriction sites of the 
vector and of the cDNA itself are shown - 

45 The 759 bp long cDNA fragment was used as probe in a further 

experiment to obtain a complete sequence of HvSD 36. To this end, 
a CDNA library from RNA of the meristematic section of 5-day-old 
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barley seedlings was available. The lambda phage ExCell Eco RICIP 
from Pharmacia (Freiburg) (product number: 27-5011, 45.5kb) was 
used for this cDNA library. 

5 A 1565 bp long cDNA clone was isolated, see sequence protocol: 
sequence ID N0:1: and 2. 

Amongst the sequences in the databases, the 434 amino acids long 
protein sequence has a homology of 58%, which is the highest 
10 homology with the HPPD sequence from Arabidopsis thaliana. 

To find a genomic clone which contains the complete HPPD gene 
sequence, a lambda FIXII library of barley was obtained from 
Stratagene (Heidelberg, product number 946104). The library was 

15 prepared using DNA from etiolated leaves of winter barley cv. 
Igri. The DNA was subjected to partial digestion with Sau3AI. 
Prior to cloning into the Xhol cleavage site of the vector, the 
fragment ends and the phage arms were filled up with nucleotides. 
Screening of the library with 200,000 pfu in the first round gave 

20 only one clone which hybridized with cDNA HvSD36. After 

subjecting this recombinant phage to restriction digestion with 
PstI and Sad, fragments of a size of 5400, 3800 and 1800 bp, 
respectively, were isolated which can be detected in a "Southern" 
blot hybridization with the HvSD36 probe. These sub-fragments 

25 exist in cloned form in the Bluescript vector. Figure 3 shows the 
construction of the barley HPPD gene in the form of a diagram. 

The invention relates in particular to expression cassettes whose 
sequence encodes an HPPD or a functional equivalent thereof, and 

30 to the use of these expression cassettes for generating a plant 
with an elevated vitamin E content. The nucleic acid sequence may 
be, for example, a DNA or a cDNA sequence. Encoding sequences 
which are suitable for insertion into an expression cassette 
according to the invention are, for example, those which encode 

35 an HPPD and which impart, to the host, the ability to overproduce 
vitamin E. 

In addition, the expression cassettes according to the invention 
comprise regulatory nucleic acid sequences which govern 

40 expression of the encoding sequence in the host cell. In 

accordance with a preferred embodiment, an expression cassette 
according to the invention comprises upstream, ie. on the 5' end 
of the encoding sequence, a promoter and downstrecon, ie. on the 
3' end, a polyadenylation signal and, if appropriate, other 

45 regulatory elements which are operatively linked with the 

encoding sequence for the HPPD gene which is located in-between. 
Operative linkage is to be understood as meaning the sequential 
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arrangement of promoter, encoding sequence, terminator and, if 
appropriate, other regulatory elements in such a way that each of 
the regulatory elements can fulfill its function as intended when 
the encoding sequence is expressed. The sequences preferred for 
5 operative linkage, but not limited thereto, are targeting 
sequences for guaranteeing subcellular localization in the 
apoplast, in the vacuole, in plastids, in the mitochondrium, in 
the endoplasmatic reticulum (ER), in the nucleus, in liposomes or 
in other compartments and translation enhancers such as the 
10 5' leader sequence from the tobacco mosaic virus (Gallie et al., 
Nucl. Acids Res. 15 (1987) 8693 - 8711). 

For example, the plant expression cassette can be incorporated 
into the tobacco transformation vector pBinAR-Hyg. Fig. 4 shows 
15 the tobacco transformation vectors pBinAR-Hyg with 35S 

promoter (A) and pBinAR-Hyg with seed-specific promoter phaseolin 
796 (B): 

« 

- HPT: hygromycin phosphotransferase 
20 - OCS: octopine synthase terminator 

- PNOS: nopaline synthase promoter 

- those restriction sites which cleave the vector only once are 
also shown. 

25 Suitable as promoters of the expression cassette according to the 
invention are, in principle, all promoters which can control the 
expression of foreign genes in plants. In particular a plant 
promoter or a promoter derived from a plant virus is preferably 
used. Particularly preferred is the CaMV 35S promoter from 

30 cauliflower mosaic viruss (Franck et al.. Cell 21 (1980) 
285 - 294). It is known that this promoter contains various 
recognition sequences for transcriptional effectors which in 
their entirety lead to permanent and constitutive expression of 
the gene introduced (Benfey et al., EMBO J. 8 (1989) 2195 - 2202)* 

35 

The expression cassette according to the invention may 
additionally comprise a chemically inducible promoter by means of 
which expression of the exogenous HPPD gene in the plant can be 
controlled at a specific point in time. Such promoters which can 

40 be used are, inter alia, for example the PRPl promoter (Ward et 
al.. Plant. Mol. Biol. 22 (1993), 361-366), a promoter which can 
be induced by salicylic acid (WO 95/19443), a promoter which can 
be induced by benzenesulf onamide (EP-A 388186), a promoter which 
can be induced by tetracyclin (Gatz et al., (1992) Plant J. 2, 

45 397-404), a promoter which can be induced by abscisic acid 
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(EP-A 335528). or a promoter which can be induced by ethanol or 
cyclohexanone (WO 93/21334 ) . 

Furthermore, particularly preferred promoters are those which 
5 ensure expression in tissues or plant organs in which the 
biosynthesis of vitamin E, or its precursors, takes place. 
Promoters which must be mentioned in particular are those which 
guarantee leaf -specif ic expression. Promoters which may be 
mentioned are the potato cytosolic FBPase or the potato ST-LSI 
10 promoter (Stockhaus et al., EMBO J. 8 (1989) 2445 - 245)- 

With the aid of a seed-specific promoter, it was possible stably 
to express a foreign protein in the seeds of transgenic tobacco 
plants in an amount of up to 0.67% of the total soluble seed 

15 protein (Fiedler and Conrad, Bio/Technology 10 (1995), 

1090-1094). The expression cassette according to the invention 
can therefore contain, for example, a seed-specific promoter 
(preferably the phaseolin promoter (US 5504200), the USP 
(Baumlein, H. et al. Mol. Gen. Genet. (1991) 225 (3), 459 - 467) 

20 or LEB4 promoter (Fiedler and Conrad, 1995)), the LEB4 signal 

peptide, the gene to be expressed and an ER retention signal. The 
construction of such a cassette is shown in the form of a diagram 
in Figure 4 by way of excimple. 

25 An expression cassette according to the invention is prepared by 
fusing a suitable promoter with a suitable HPPD DNA sequence and 
preferably a DNA which is inserted between promoter and HPPD DNA 
sequence and which encodes a chloroplast-specif ic transit 
peptide, and a polyadenylation signal, using customary 

30 recombination and cloning techniques as they are described, for 
example, in T. Maniatis, E.F. Fritsch and J. Sambrook, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY (1989) and in T.J. Silhavy, M.L. Berman and 
L.W. Enquist, Experiments with Gene Fusions, Cold Spring Harbor 

35 Laboratory, Cold Spring Harbor, NY (1984) and in Ausubel, F.M. et 
al.. Current Protocols in Molecular Biology, Greene Publishing 
Assoc. and Wiley-Interscience (1987). 

Particularly preferred sequences are those which guarantee 
40 targeting into the apoplast, into plastids, into the vacuole, the 
mitochondrium, the endoplasmatic reticulum (ER), or, by means of 
the absence of suitable operative sequences, the remaining in the 
compartment of formation, the cytosol (Kermode, Crit. Rev. Plant 
Sci. 15, 4 (1996), 2 85 - 423) • Localization in the ER has proved 
45 to be especially advantageous for the amount of protein 
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accumulation in transgenic plants (Schouten et al. , Plant Mol. 
Biol. 30 (1996), 781 - 792)* 

The invention also relates to expression cassettes whose DNA 
5 sequence encodes an HPPD fusion protein, a moiety of the fusion 
protein being a transit peptide which governs translocation of 
the polypeptide. Especially preferred are chloroplast-specif ic 
transit peptides which are cleaved enzymatically from the HPPD 
moiety after the HPPD gene product has been translocated into the 
10 chloroplasts. Particularly preferred is the transit peptide which 
is derived from plastid transketolase (TK) or a functional 
equivalent of this transit peptide (eg. the transit peptide of 
the small subunit of rubisco or of Ferredoxin NADP 
oxidoreductase) . 

15 

The HPPD-encoding nucleotide sequence inserted can be prepared 
synthetically or obtained naturally or comprise a mixture of 
synthetic and natural DNA components. In general, there are 
prepared synthetic nucleotide sequences with codons which are 

20 preferred by plants. These codons which are preferred by plants 
can be determined from amongst codons with the highest protein 
frequency which are expressed in most interesting plant species. 
When preparing an expression cassette, various DNA fragments may 
be manipulated in order to obtain a nucleotide sequence which 

25 expediently reads in the correct direction and which is provided 
with a correct reading frame. To connect the DNA fragments to 
each other, adaptors or linkers may be joined onto the fragments- 

The promoter and terminator regions according to the invention 
30 may advantageously be provided, in the direction of 

transcription, with a linker or polylinker which comprises one or 
more restriction sites for insertion of this sequence. As a rule, 
the linker has 1 to 10, in most cases 1 to 8, preferably 2 to 6, 
restriction sites. In general, the linker within the regulatory 
35 regions has a size of less than 100 bp, frequently less than 

60 bp, but at least 5 bp. The promoter according to the invention 
may be both native, or homologous, but also foreign, or 
heterologous, to the host plant. The expression cassette 
according to the invention comprises, in the 5 '-3' transcription 
40 direction, the promoter according to the invention, any desired 
DNA sequence and a region for transcriptional termination. 
Various termination regions can be exchanged for each other as 
desired. 



45 It is furthermore possible to employ manipulations which provide 
suitable restriction sites or which remove excess DNA or 
restriction sites. Where insertions, deletions or substitutions. 
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eg. transitions and transversions, are suitable, it is possible 
to use in vitro mutagenesis, primer repair, restriction or 
ligation. In the case of suitable manipulations, eg. restriction, 
chewing back or filling up overlaps for blunt ends, complementary 
5 ends of the fragments may be provided for ligation. 

What may be of importance for the success according to the 
invention is, inter alia, attaching the specific ER retention 
signal SEKDEL (Schouten, A. et al. Plant Mol. Biol. 30 (1996), 
10 781 - 792), which results in a three to four times higher than 
average expression level. Other retention signals which occur 
naturally in plant and animal proteins which are localized in the 
ER may also be used for constructing the cassette - 

15 Preferred polyadenylation signals are plant polyadenylation 

signals, preferably those which correspond essentially to T-DNA 
polyadenylation signals from Agrobacterium tumefaciensr in 
particular of gene 3 of the T-DNA (octopine synthase) of the Ti 
plasmid pTiACHS (Gielen et al., EMBO J. 3 (1984) 835 et seq.), or 

20 functional equivalents. 

An expression cassette according to the invention may comprise, 
for example, a constitutive promoter (preferably the CaMV 35S 
promoter), the LeB4 signal peptide, the gene to be expressed and 
25 the ER retention signal. The preferred ER retention signal used 
is the amino acid sequence KDEL (lysine, aspartic acid, glutamic 
acid, leucine) . 

The fused expression cassette which encodes an HPPD gene is 
30 preferably cloned into a vector, for example pBinl9, which is 

suitable for transforming Agrobacterium tumefaciens. Agrobacteria 
which are transformed with such a vector can then be used in the 
known manner for transforming plants, in particular crop plants, 
eg. tobacco plants, for example by immersing scarified leaves or 
35 leaf sections in an agrobacteria solution and subsequently 

growing them in suitable media. The transformation of plants by 
means of agrobacteria is known, inter alia, from F.F. White, 
Vectors for Gene Transfer in Higher Plants; in Transgenic Plants, 
Vol. 1, Engineering and Utilization, Eds. S.D. Kung and R. Wu, 
40 Academic Press, 1993, pp. 15-38. The transformed cells of the 
scarified leaves or leaf sections can be used for regenerating, 
in the known manner, transgenic plants which contain a gene for 
expression of an HPPD gene integrated into the expression 
cassette according to the invention. 

45 
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To transform a host plant with an HPPD-encoding DNA, an 
expression cassette according to the invention is incorporated 
into a recombinant vector in the form of an insertion, and the 
vector DNA of this recombinant vector additionally comprises 
5 functional regulation signals, for example sequences for 

replication or integration. Suitable vectors are described, inter 
alia, in "Methods in Plant Molecular Biology and Biotechnology" 
(CRC Press), Chapter. 6/7, pp. 71 - 119 (1993). 

10 Using the above-cited recombination and cloning techniques, the 
expression cassettes according to the invention can be cloned 
into suitable vectors which allow their multiplication, for 
example in E. coii. Suitable cloning vectors are, inter alia, 
pBR332, pUC series, M13mp series and pACYC184. Especially 

15 suitable are binary vectors which are capable of replicating not 
only in E. coli, but also in agrobacteria. 

The invention furthermore relates to the use of an expression 
cassette according to the invention for transforming plants, 
20 plant cells, plant tissues or plant organs. The preferred purpose 
of the use is to raise the vitamin E content of the plant. 

Depending on the choice of the promoter, expression may take 
place specifically in the leaves, in the seeds or in other plant 
25 organs. The present invention also relates to such transgenic 
plants, their propagation material and their plant cells, plant 
tissues or plant organs. 

In addition, the expression cassette according to the invention 
30 may also be employed for transforming bacteria, cyanobacteria, 
yeasts, filamentous fungi and algae with the purpose of raising 
the vitamin E production. 

The transfer of foreign genes into the genome of a plant is 
35 termed transformation. This process exploits the previously 
described methods of transforming and regenerating plants from 
plant tissues or plant cells to obtain transient or stable 
transformation. Suitable methods are protoplast transformation by 
polyethylene-glycol induced DNA uptake, the ballistic method with 
40 the gene gun - the so-called particle bombardment method, 

electroporation, incubation of dry embryos in DNA-containing 
solution, microinjection and Agrobacteri uni-mediated gene 
transfer. The abovementioned methods are described, for example, 
in B. Jenes et al.. Techniques for Gene Transfer, in: Transgenic 
45 Plants, Vol. 1, Engineering and Utilization, Eds. S.D. Kung and R. 
Wu, Academic Press (1993) 128 - 143, and in Potrykus Annu. Rev, 
Plant Physiol. Plant Molec. Biol. 42 (1991) 205 - 225)- The 
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construct to be expressed is preferably cloned into a vector 
which is suitable for transforming AgroiJacteriujn tumefaciens , for 
example pBinl9 (Bevan et al., Nucl. Acids Res. 12 (1984) 8711). 

5 Agrobacteria transformed with an expression cassette according to 
the invention can also be used, in a known manner, for 
transforming of plants, in particular crop plants such as 
cereals, maize, oats, soya, rice, cotton, sugar beet, canola, 
sunflowers, flax, hemp, potatoes, tobacco, tomatoes, oilseed 
10 rape, alfalfa, lettuce and the various tree, nut and grapevine 
species, for example by immersing scarified leaves or leaf 
sections in an agrobacteria solution and subsequently growing 
them in suitable media. 

15 Functionally equivalent sequences which encode an HPPD gene are, 
in accordance with the invention, those sequences which still 
have the desired functions despite a different nucleotide 
sequence. Thus, functional equivalents embrace naturally 
occurring variants of the sequences described herein and also 

20 artificial nucleotide sequences, eg. artificial nucleotide 
sequences which have been obtained by chemical synthesis and 
which are adapted to the codon usage of a plant. 

A functional equivalent is also to be understood as meaning, in 
25 particular, natural or artificial mutations of an originally 
isolated HPPD-encoding sequence which continues to show the 
desired function. Mutations encompass substitutions, additions, 
deletions, inversions or insertions of one or more nucleotide 
residues. Thus, the present invention also encompasses those 
30 nucleotide sequences which are obtained by modifying the present 
nucleotide sequence. The purpose of such a modification may be, 
for example, a further limitation of the encoding sequence 
contained therein, or else, for example, the insertion of further 
cleavage sites for restriction enzymes. 

35 

Functional equivalents are also those variants whose function is 
less or more pronounced in comparison with the starting gene or 
gene fragment . 

40 Also suitable are artificial DNA sequences as long as they, as 
described above, mediate the desired characteristic of raising 
the vitamin E content in the plant by overexpressing the HPPD 
gene in crop plants. Such artificial DNA sequences can be 
determined for example by back-translation of proteins 

45 constructed with the aid of molecular modeling and which have 
HPPD activity, or by in vitro selection. Especially suitable are 
encoding DNA sequences which were obtained by back-translating a 
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polypeptide sequence in accordance with the codon usage specific 
to the host plant. The specific codon usage can be determined 
readily by an expert familiar with plant genetic methods using 
computer evaluations of other, known genes of the plant to be 
5 transformed. 

Further suitable equivalent nucleic acid sequences according to 
the invention which must be mentioned are sequences which encode 
fusion proteins, a component of the fusion protein being a plant 

10 HPPD polypeptide or a functionally equivalent moiety thereof. The 
second moiety of the fusion protein can be, for example, a 
further polypeptide with enzymatic activity or an antigenic 
polypeptide sequence with the aid of which the detection of HPPD 
expression is possible (eg. myc-tag or his-tag). However, this is 

15 preferably a regulatory protein sequence, eg. a signal or transit 
peptide, which leads the HPPD protein to the desired site of 
action. 

However, the invention also relates to the expression products 
20 and fusion products, of a transit peptide and a polypeptide with 
HPPD activity, which have been produced in accordance with the 
invention. 

Raising the vitamin E content means, for the purposes of the 
25 present invention, the artificially acquired ability of an 
elevated vitamin E biosynthesis performance by means of 
functional overexpression of the HPPD gene in the plant in 
contrast to the non-genetically-engineered plant for the duration 
of at least one plant generation. 

30 

The vitamin E biosynthesis site is generally the leaf tissue, so 
that leaf-specific expression of the HPPD gene is expedient. 
However, it will be understood readily that vitamin E 
biosynthesis is not necessarily restricted to the leaf tissue, 
35 but may also take place tissue-specifically in all other organs 
of the plant, for example in fatty seeds. 

in addition, constitutive expression of the exogenous HPPD gene 
is advantageous. On the other hand, inducible expression may also 
40 appear desirable. 

The efficacy of expression of the transgenically expressed HPPD 
gene can be determined for example in vitro by shoot meristem 
propagation. In addition, changes in the nature and level of HPPD 
45 gene expression, and its effect on the vitamin E biosynthesis 
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performance on test plants, can be tested in greenhouse 
experiments . 

The invention furthermore relates to transgenic plants 
5 transformed with an expression cassette according to the 
invention, and to transgenic cells, tissues, organs and 
propagation material of such plants. Especially preferred in this 
context are transgenic crop plants, eg. barley, wheat, rye, 
maize, oats, soya, rice, cotton, sugar beet, canola, sunflowers, 
10 flax, hemp, potatoes, tobacco, tomatoes, oilseed rape, alfalfa, 
lettuce and the various tree, nut and grapevine species. 

Plants for the purposes of the invention are mono- and 
dicotyledonous plants or algae. 

15 

As already mentioned, HPPD is a suitable target for 
sulcotrione-type herbicides. To allow even more efficient HPPD 
inhibitors, it is necessary to provide suitable test systems with 
which inhibitor/enzyme binding studies can be carried out. To 
20 this end, for example, the complete barley HPPD cDNA sequence is 
cloned into an expression vector (pQE, Qiagen) and overexpressed 
in E. coli- 

The HPPD protein expressed with the aid of the expression 
25 cassette according to the invention is particularly suitable for 
finding HPPD-specif ic inhibitors. 

TO this end, the HPPD can be employed, for example, in an enzyme 
assay in which the HPPD activity is determined in the presence 
30 and absence of the active substance to be tested. A comparison of 
the two activity determinations allows qualitative and 
quantitative findings on the inhibitory behavior of the active 
substance to be tested to be obtained. 

35 The test system according to the invention allows a large number 
of chemical compounds to be screened rapidly and simply for 
herbicidal properties. The method allows the targeted and 
reproducible selection, amongst a large number of substances, of 
those with great potency in order to subject these substances 

40 subsequently to further in-depth tests with which the expert is 
familiar. 

The invention furthermore relates to herbicides which can be 
identified with the above-described test system. 

45 
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Overexpression in a plant of the gene sequence Seq ID NO: 1, which 
encodes an HPPD, results in an elevated resistance to HPPD 
inhibitors. The invention also relates to the transgenic plants 
thus generated. 

5 

The invention furthermore relates to: 

- A method of transforming a plant, which comprises introducing 
an expression cassette according to the invention into a plant 

10 cell, into callus tissue, into an entire plant or into plant 
protoplasts . 

- The use of a plant for generating plant HPPD. 

15 - The use of the expression cassette according to the invention 
for generating plants with elevated resistance to HPPD 
inhibitors by means of higher expression of a DNA sequence 
according to the invention. 

20 - The use of the expression cassette according to the invention 
for generating plants with an elevated vitamin E content by 
means of expressing, in plants, a DNA sequence according to the 
invention. 

25 - The use of the expression cassette according to the invention 
for generating a test system for identifying HPPD inhibitors. 

The invention is illustrated by the examples which follow, but 
not limited thereto: 

30 



35 



40 



45 
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General cloning methods 

The cloning steps carried out within the scope of the present 
5 invention, eg. restriction cleavages, agarose gel 

electrophoresis, purification of DNA fragments, transfer of 
nucleic acids onto nitrocellulose and nylon membranes, linking 
DNA fragments, transformation of E. coli cells, growing bacteria, 
multiplying phages and sequence analysis of recombinant DNA, were 
10 carried out as described by Sambrook et al. (1989) Cold Spring 
Harbor Laboratory Press; ISBN 0-87969-309-6). 

The bacterial strains used hereinbelow (E. coli, XL-I Blue) were 
obtained from Stratagene and, in the case of NP66, Pharmacia. The 

15 agrobacterial strain used for the transformation of plants 
(Agrobacterium tumefaciens, C58C1 with plasmid pGV2260 or 
pGV3850kann) was described by Deblaere et al. in (Nucl. Acids 
Res. 13 (1985) 4777). Alternatively, the agrobacterial strain 
LBA4404 (Clontech) or other suitable strains may also be 

20 employed. Vectors which can be used for cloning are the vectors 
PUC19 (Yanish-Perron, Gene 33 (1985), 103 - 119) pBluescript SK- 
( Stratagene ) , pGEM-T (Promega), pZerO (Invitrogen), pBinl9 (Bevan 
et al., Nucl. Acids Res. 12 (1984), 8711 - 8720) and pBinAR 
(Hofgen and Willmitzer, Plant Science 66 (1990) 221 - 230). 



25 



Sequence analysis of recombinant 



DNA 



Recombinant DNA molecules were sequenced using a laser 
fluorescence DNA sequencer by Licor (available from MWG Biotech, 
30 Ebersbach) following the method of Sanger (Sanger et al., Proc. 
Natl. Acad. Sci. USA 74 (1977), 5463 - 5467). 



Generation of plant expression cassettes 

35 Into plasmid pBinl9 (Bevan et al., Nucl. Acids Res. (1984) 12, 
8711) there was inserted a 35S CaMV promoter in the form of an 
EcoRI-Kpnl . fragment corresponding to nucleotides 6909 - 7437 of 
cauliflower mosaic virus (Franck et al. Cell 21 (1980) 285). The 
polyadenylation signal of gene 3 of the T-DNA of the Ti plasmid 

40 pTiACHS (Gielen et al., EMBO J. 3 (1984) 835), nucleotides 

11749 - 11939, was isolated as a PvuII-Hindlll fragment and, after 
addition of SphI linkers, cloned into the PvuII cleavage site 
between the SpHI-Hindlll cleavage site of the vector pBmAR-Hyg, 
This gave the plasmid pBinAR (Hofgen and Willmitzer, Plant 

45 Science 66 (1990) 221 - 230). 
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Use Examples 
Example 1 

5 Isolation of HPPD-specif ic cDNA sequences 

The composition of the mRNA population from primary leaves of 
nine-day-old barley plants which had been grown in an L/D 
photoperiod (16 hours light/8 hours dark) was compared with that 

10 of primary leaves of 11-day-old barley plants in which, after a 
raising period of nine days, senescence was subsequently induced 
by a two-day dark treatment (Humbeck and Krupinska, J. Photochem, 
Photobiol. 36 (1996), 321 - 326) with the aid of the DDRT-PCR 
method published by Liang and Pardee (Science (1992) 257, 967 - 

15 972). In each case 0.2 jxg of the total RNA was converted into cDNA 
using the enzyme "Superskript RT" (Gibco BRL, Eggenstein) . In 
addition to the RNA, the reaction batches (20 \xl) also contained 
20 \it\ dNTPs, 10 [iM DTT, IxRT buffer and in each case 1 jiM 
(dT)12VN primer. The anchor "primers" required for these 

20 reactions were synthesized on the basis of the data of Liang and 
Pardee : 

1 . 5 ' -TTTTTTTTTTTTAG-3 ' 
2 . 5 ' -TTTTT(rTTTTTTCA-3 ' 
253. 5 ' -TTTTTOTTTTTTAC-3 ' 
4 . 5 ' -TTTTTTTTTTTTGT-3 ' 

^-^After the cDNAsNwere synthesized, amplification of the relevant 
/sequences was effected in each case in ten batches, which differ 
by the use of theVandom "primers" given hereinbelow: 



1. 


5 ' -TACAACGAGG-3S' 


2. 


5' 


-GGAACCAATC-3 ' 


3. 


5 ' -AAACTCCGTC-3 \ 


4. 


5' 


-TGGTAAAGGG-3 ' 


5. 


5 ' -CTGCTTGATG-3 ' >i 


6. 


5' 


-GTTTTCGCAG-3 ' 


35 7. 


5 ' -GATCTCAGAC-3 ' \ 


8. 


5' 


-GATCTAACCG-3 ' 


9. 


5 ' -GATCATGGTC-3 ' 


\ 10. 


5' 


-GATCTAAGGC-3 ' 



In a volume of in each case 20 fxl, the PGR reaction batches 
contained IxPCR buffer, 2 \jM dNTPs, 2.5 jxCi (a 33p)-dATP, 1 \M 
(dT)i2VN-"primer", 1/10 vol. RT mix (Sambrook et al. Molecular 
Cloning - A Laboratory Manual, 1989), 1 U Taq DNA polymerase 
(Boehringer, Mannheim) and 1 \jM 10-mer random "primers". The 
PCR-reactions proceeded in a Uno block (Biometra) following the 
program below: 

45 




1. 94**C 2 min 

2. 94*'C 30 s 
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3. 40°C 2 min 

4. 72°C 30 s 

5. 72°C 5 min 

6. 4°C storage until further processing 

5 

Steps 2, 3 and 4 were carried out 40 times in succession. This 
gave approximately 100 cDNA bands per reaction and "primer" 
combination . 

10 In contrast to the protocol of Liang and Pardee, the amplified 
cDNA fragments were separated in non-denaturing polyacrylamide 
gels of the following composition: 6% (w/v) acrylamide (Long 
Ranger, AT Biochem) , 1.2 x TBE buffer, 0.005% (v/v) TEMED and 
0.005% (w/v) APS (Bauer et al, Nucl. Ac. Res. (1993) 21, 

15 4272 - 4280) . 

In each case 3.5 jil of each PGR batch were treated with 2 ^1 of 
loading buffer (dye II, Sambrook et al., 1989) and then loaded 
onto the gel. To determine the reproducibility of the cDNA band 

20 patterns (Fig. 5), in each case two independent RNA preparations 
(9 and 9', 11 and 11') were prepared from the barley primary 
leaves harvested on days 9 and 11 and used in parallel in the 
analysis below. What is shown is the result of two different 
primer combinations (A and B) ; by way of example, two differences 

25 in the band pattern between the sample of days 9 and 11 were 

emphasized by arrows. Only those bands which occurred equally in 
the two samples from senescing plants and which did not occur in 
the two comparison samples were taken into consideration when 
analyzing the gels at a later point in time. Electrophoresis was 

30 carried out over a period of 2.5 hours at 4 0 watt (0.8 w/cm^) in 
1 x TBE buffer. After separation of the cDNA fragments had 
occurred, the gel was transferred onto filter paper (Schleicher & 
Schiill, Dassel). After the gel had been dried at 50°C, an X-ray 
film was placed on top of it. cDNA bands which were only found in 

35 the case of samples 11 and 11' in the autoradiograph were excised 
from the dry gel using a surgical blade, and the DNA was eluted 
by boiling in 100 jxl 1 x TE buffer. The ethanol-precipitated DNA 
was resuspended in 10 \il of water for further tests. After 
reamplif ication with the "primers" previously used for this 
batch, the DNA was cloned and sequenced and also employed as a 
probe for Northern blot hybridizations. 



To test if the relevant cDNA fragment actually represents a 
senescence-specifically occurring transcript, hybridizations were 
carried out with RNA from leaves of various developmental stages: 
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A. 1. RNA from primary leaves from plants raised for 9 days in 
an L/D photoperiod 

A. 3. RNA from primary leaves from 10-day-old plants raised 
5 without a light phase on day 10 

A. 4 RNA from primary leaves from 11-day-old plants which 

lacked a light phase on days 10 and 11 

10 A. 5 RNA from primary leaves from 12-day-old plants which 

underwent a further light phase after 2 days in the dark 

The samples for RNA analysis were harvested in each case in the 
middle of the original night phase. 

15 

B. RNA from flag leaves which had been collected in the 
field at seven different points in time (Fig. 6). The 
leaves were fully grown on 29 May and showed less than 
10% of the original chlorophyll content on 21 June. The 

20 beginning of the senescence processes is shown in 

Figure 6 by an arrow (ie. 17 days after reaching the full 
length on 15 June). The beginning of senescence was 
defined as the day on which photosystem II efficacy 
dropped (Humbeck et al.. Plant Cell Environment (1996) 

25 19: 337 - 344) . 

To hybridize a filter with the above-described RNA samples, a 
specific probe for the rbcS gene, which encodes the small 
sub-unit of ribulose-1, 5-bisphosphate carboxylase, was also 

30 employed in addition to the HPPD probe, for comparison reasons. 
Figure 6 shows hybridization of the "Northern blots" A and B with 
cDNA HVSD36 and with a probe which is specific for the rbcS gene. 
Filter A carries RNA from barley primary leaves after a raising 
period of 9 days in an L/D photoperiod (9), after subsequent 

35 incubation in the dark for one and two days, respectively (10, 
11) and after subsequent return to light conditions for one day 
(12). Filter B contains RNA from flag leaves which had been 
harvested in the field in the period from 29.05. to 21.06.1992. 
The arrow indicates the beginning of senescence on 15.06. As can 

40 been seen from Figure 6, the amount of rJbcS-specif ic mRNA is high 
when the amount of HPPD-specif ic mRNA is relatively low. In 
primary leaves of nine-day-old plants, the HPPD-specif ic mRNA is 
not detectable prior to transfer into the dark and accumulates 
markedly during the dark phase. When the plants are returned to 

45 light conditions, the amount of this mRNA drops markedly. In the 
case of the flag leaves, small amounts of the HPPD-specif ic mRNA 
can already be detected in fully-grown, non-senescent leaves. As 
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early as 4 days prior to the actual beginning of senescence, 
expression levels are higher. The highest amount of this mRNA can 
be found in senescent leaves. A size comparison with known RNA 
species showed that the transcript detected with the cDNA probe 
5 HVSD36 (s: senescence; d: dark, fragment number 36 in the DDRT 
gel) has a length of approx. 1.6 kb. 

By means of DDRT PGR, three cDNA fragments were obtained 
independently of each other which showed this expression pattern 

10 and which, on the basis of sequence analysis, actually represent 
the same transcript. The longest fragment had a size of 230 bp. 
The 230 bp long PGR product was finally cloned into the Smal 
cleavage site of vector pUG18 using the "Sure Glone™ ligation 
kit" (Pharmacia, Freiburg) following the manufacturer's 

15 instructions. The recombinant plasmid was transformed into 

competent cells of E. coli strain DH5 a. Since, for methodology 
reasons, the fragment represents the 3' end of the relevant 
transcript, the sequence information was first insufficient to 
identify an unambiguous homology with a sequence in the 

20 databases. To isolate a longer corresponding cDNA, a lambda ZAPII 
library (Stratagene, Heidelberg) of RNA of senescent flag leaves 
was screened using the 230 bp long fragment as the probe. For this 
step, the probe was labeled with Dig-dUTP following the 
instructions of the "DNA Labeling and Detection Kit" (Boehringer, 

25 Mannheim) . The library was examined following the protocol of the 
"ZAP-cDNA Synthesis Kit" (Stratagene, Heidelberg). 

In the case of the probe described herein, 150,000 pfu were 
examined. Of these, 39 phage plaques gave a positive signal. Of 

30 these, further work was carried out on 12 phage populations. 

Following phage preparation, the fragments inserted were enriched 
via PGR and separated by electrophoresis. Southern blot 
hybridization with the HvSD36 probe allowed those phage 
populations which had the largest "inserts" with positive signal 

35 to be selected amongst the 12 phage populations thus treated. 
After replating, the phages were subjected to a further 
hybridization step. Single phage plaques were excised and, after 
elution, subjected to an in vivo excision using a helper phage 
and following the protocol from Stratagene (Exassist™ 

40 interference-Resistant Helper Phage with SOLR ™ strain) . The 
so-called "phagemids" obtained from this treatment contain the 
cDNA cloned in pBLueskript (SK-) . 

Following a subsequent plasmid preparation, the relevant "insert" 
45 was excised from the Bluescript plasmid using EcoRI. The cDNA 
clone obtained in the case of HvSD36 contains an "insert" with a 
length of approx. 800 bp. Complete sequencing of the cDNA was 
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carried out using the "SequiTherm Excel Long-Read 
DNA-Sequenzierungs-Kit" (Epicentre Technologies, Biozym 
Diagnostic, Oldendorf ) using IRD41-labeled universal "primers" 
which bind to sequence regions in the Bluescript vector, 
5 Detection of the DNA fragments was effected via the infrared 
laser of the automatic sequencer 4000L by Licor. After 
sequencing, an exactly 759 bp long sequence was present whose 
sides are flanked by an in each case 14 bp long adaptor sequence. 
These adaptor sequences were used for ligating the cDNA fragments 
10 with the arms of phage lambda ZAPII (Stratagene, Heidelberg) when 
generating the c-DNA library, 

Amongst the sequences in the databases, the protein sequence 
HvSD36, which has a total of over 180 amino acids, has a homology 
15 of 41% with the sequence of human HPPD which is the highest. 
Taking into consideration the length of the transcript detected 
in the "Northern blot" (approx. 1600 nt), it can be assumed that 
850-900 bp are still missing from the cDNA. 

20 To complete the cDNA, a further cDNA library was investigated. 
mRNA was isolated from the basal meristematic zone of 5-day-old 
barley seedlings with the aid of "Dynabeads" (Dynal, Hamburg) and 
transcribed into cDNA using the "Time Saver cDNA SyntheseKit" 
(Pharmacia, Freiburg). This was followed by ligation of 

25 EcoRI/NotI adaptors (Pharmacia, Freiburg) to the cDNA with 
subsequent ligation into the lambda ExCell vector (Pharmacia, 
Freiburg) . Finally, the recombinant phage DNA was packaged into 
phage proteins with the aid of "Gigapack II Gold Set" (Stratagene, 
Heidelberg). Using the 759 bp long probe HvSD36, 400,000 pfu were 

30 screened, and 5 phages were detected by the probe. Excision of the 
"phagemids" from the phage was effected in vivo with the aid of 
bacterial strain NP66 following the instructions of Pharmacia 
(Freiburg). The recombinant pExCell plasmids were isolated from 
the individual bacterial colonies and transferred into bacterial 

35 strain D115 a for propagation. 

The longest cDNA clone HvSD36 isolated in this manner has a 
length of 1565 bp and was sequenced completely (see sequence 
protocol) . 



40 



Excimple 2 



Characterization of the genomic sequence 



45 To identify a genomic clone which contains the gene sequence of 
HPPD, a lambda FIXII library of barley was obtained from 
Stratagene (Heidelberg). The library was prepared using DNA from 
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etiolated leaves of winter barley cv. Igri. The DNA was partially 
digested with Sau3AI. Prior to cloning into the Xhol cleavage 
site of the vector, the fragment ends and the phage arms were 
filled up with nucleotides. Screening of the library with 
5 200,000 pfu in the first round only gave one clone which 

hybridized with cDNA HvSD36. After subjecting this recombinant 
phage to restriction digestion with PstI and Sad, fragments 
5400, 3800 and 1800 bp in length were subsequently isolated which 
can be detected with the HvSD36 probe when carrying out a 
10 "Southern" blot hybridization. These sub-fragments exist in 
cloned form in the Bluescript vector. 

The library was screened following the protocol given for the 
HybondN membrane. Labeling of the probe for screening the library 
15 and for the "Southern" blot hybridizations was effected via 

"random priming" with 32p_ciATP using the Klenow enzyme (Sambrook 
et al., (1989) Molecular cloning. A laboratory manual. Cold 
Spring Harbor Laboratory, New York) • 

20 A genomic "Southern blot" was carried out with total DNA from 

barley (Carina) (Fig. 7). In each case 15 \ig of DNA were digested 
with BamHI (B) , EcoRI (E), Hindlll (H) or XBAI (X) and separated 
in a 0.75% agarose gel. After transfer to a Hybond N+ membrane 
(Amersham, Braunschweig), hybridization was effected with the 

25 incomplete, 759 bp long cDNA probe from HvSD36 following 
instructions of the membrane manufacturer. The following 
fragments were detected: 

BamHI: 6.0, 3.9 and 3.0 kbp 

30 EcoRI: >10 kbp 

Hindlll: 8.3, 2.6, 1.1 and 1.0 kbp 
Xbal: 9.0, 5.2 and 4.2 kbp 

The fragment lengths were estimated by comparison with a DNA size 
35 standard (Kb-Leiter, GibcoBRL, Eggenstein) . 



Exeunple 3 

Homology comparison of the HvSD36 protein sequence 

40 

A comparison of the HvSD36 protein sequence with protein 
sequences in the database revealed homologies to the following 
protein sequences known to date: 



45 
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HPPD_Hv 
HPPD_Ath 
HPPD__HUMAN 
HPPD_RAT 
HPPD_PIG 
HPPD_MOUSE 
HPPD_PSESP 
10 MELA_SHECO 
PEA3 MOUSE 



10 20 30 40 50 

MP PTPTTPAATG 

. /. MGHQNAA VSENQNHDDG 

MTKSSNHNCL LRPENKPGLW GPGAQAASLR PSPATLWSS PGHAEHPPAA 



15 



HPPD_Hv 
HPPD__Ath 
HPPD_HUMAN 
HPPD_RAT 
HPPD_PIG 
HPPD_MOUSE 
HPPD_PSESP 
MELA_SHECO 
20 PEA3 MOUSE 



60 70 
AAAAVTPEHA RPHRMVRFNP 
AASSPGFKLV GFSKFVRKNP 
M TTYSDKGAKP 
YWDKGPKP 
M TSYSDKGEKP 
M TTYNNKGPKP 



PAQTPGPQVS ASARGPGPVA 



80 

RSDRFHTLSF 
KSDKFKVKRF 
ERGRFLH — F 
ERGRFLH — F 
ERGRFLH — F 
ERGRFLH — F 
ADLYENP 
MASEQNP 
GGSGRMERIOl 



90 

HHVEFWCADA 
HHIEFWCGDA 
HSVTFWVGNA 
HSVTFWVGNA 
HSVTFWVGNA 
HSVTFWVGNA 
MGLMGFEFIE 
LGLLGIEFTE 
KGGYL DQ 



100 

ASAAGRFAFA 
TNVARRFSWG 
KQAASFYCSK 
KQAASFYCNK 
KQAASYYCSK 
KQAASFYCNK 
LASPTPNTIiE 
FATPDLDFMH 
RVPYTFCSKS 



25 



HPPD_Hv 

HPPD_Ath 

HPPD_HUMAN 

HPPD_RAT 

HPPD_PIG 

HPPD_MOUSE 

HPPD_PSESP 

MELA_SHECO 

PEA3 MOUSE 



30 



HPPD_Hv 
HPPD_Ath 
HPPD_HUMAN 
HPPD_RAT 
35 HPPD_PIG 

HPPD_MOUSE 
HPPD_PSESP 
MELA_SHECO 
PEA3 MOUSE 



110 

LGAPLAARSD 
LGMRFSAKSD 
MGFEPLAYRG 
MGFEPLAYKG 
IGFEPLAYKG 
MGFEPLAYRG 
PIFEIMGFTK 
KVFIDFGFSK 
PGNGSLGEAL 



120 

LSTGNSAHAS 
LSTGNMVHAS 
LETGSREWS 
LETGSREWS 
LETGSREWS 
LETGSREWS 
VATHRSKDV- 
LKKHKQKDI- 
MVPQGKLMDP 



160 170 

TASLPSFS ADAARRFSAD 

TTTASIPSFD HGSCRSFFSS 

WN KEMGDHL-VK 

WN KEMGDHL-VK 

WN. KEMGDHL-VK 

WN KEMGDHL-VK 

P HSVASYFAAE 

K QGFSAQFAKT 

SDEQFVPDFH SENLAFH 



130 

QLLRSGSLAF 
YLLTSGDLRF 
HVIKQGKIVF 
HVIKQGKIVF 
HWKQDKIVF 
HVIKRGKIVF 
HLYRQGAINL 
VYYKQNDINF 
GSLPPSDSED 

180 

HGIAVRSVAL 
HGLGVRAVAI 
HGDGVKDIAF 
HGDGVKDIAF 
HGDGVKDIAF 
HGDGVKDIAF 
HGPSVCGMAF 
HGPAISSMGW 
SPTTRIKKEP 



140 



150 



LFT — APYAN G-CDAA 

LFT — APYSP S-LSAGEIKP 

VLS — SA LNP 

VLC — SA LNP 

VFS — SA LNP 

VLC — SA LNP 

ILN — NE 

LLN — NE 

LFQDLSHFQE TWLAEAQVPD 



190 

RVADAAEAFR 
EVEDAESAFS 
EVEDCDYIVQ 
EVEDCEHIVQ 
EVEDCDYIVQ 
EVEDCDHIVQ 
RVKDSQKAYK 
RVEDANFAFE 
QSPRTDPALS 



200 

ASRRRGARPA 
ISVANGAIPS 
KARERGAKIM 
KARERGAKIV 
KARERGAIIV 
KARERGAKIV 
RALELGAQPI 
GAVARGAKPA 
CSRKPPLPYH 



40 

HPPD_Hv 
HPPD_Ath 
HPPD_HUMAN 
HPPD_RAT 
HPPD_PIG 
45 HPPD_MOUSE 
HPPD_PSESP 
MELA_SHECO 
PEA3 MOUSE 



210 220 230 240 250 

FAPV DLGRG FAFAEVELYG — DWLRFVS HP — DG — TO 

Sppi VLNEA VTIAEVKLYG — DWLRYVS YKAEDT — EK 

REP WVEQDKFGK VKFAVLQTYG DTTHTLVE KMN YI 

REP WVEEDKFGK VKFAVLQTYG DTTHTLVE KIN YT 

REEVC-CAAD VRGHHTPLDR AR QVWE — GT LVE KMT FC 

REP WVEQDKFGK VKFAVLQTYG — DTTHTLVE KIN YT 

HI ETGPME LNLPAIKGIG GAPLYLID RFGEGSSIYD 

AD EV — KD LPYPAIYGIG DSLIYFID TFGDDNNIYT 

HGEQCLYSRQ lAIKSPAPGA PGQSPLQPFS RAEQQQSLLR ASSSSQSHPG 
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HPPD_ 
HPPD_ 

hppd" 

5 hppd] 
HPPD_ 

hppd" 

MELA_ 
PEAS 



Hv 

'human 

^RAT 
PIG 
MOUSE 
'PSESP 
[SHECO 
MOUSE 



260 

VPFLPGFEGV 
SEFLPGFERV 
GQFLPGYEAP 
GRFLPGFEAP 
LDSRPQPSQT 
GRFLPGFEAP 
IDFV — FLEG 

SDF EA 

HGYLGEHSSV 



25 

270 

TNPDA 

EDASSF P 

AFMDPLLPKL 
TYKDTLLPKL 
LLHRLLLSKL 
TYKDTLLPKL 
VDRHPVGA — 
LDEPIITQ — 
FQQPVDMCHS 



280 

VDYGLTRFDH 
LDYGIRRLDH 
PKCSLEMIDH 
PSCNLEIIDH 
PKCGLEIIDH 
PRCNLEIIDH 

GLKIIDH 

-EKGFIEVDH 
FTSPQGGGRE 



290 

WGNVP — EL 
AVGNVP — EL 
IVGNQPDQEM 
IVGNQPDQEM 
IVGNQPDQEM 
IVGNQPDQEM 
LTHNVYRGRM 
LTNNVHKGTM 
PLPAPYQHQL 



300 

-APAAAYIAG 
-GPALTYVAG 

-VSASEW 

-ESASEW 

-ESASQW 

-QSASEW 

-A YWANF 

-E YWSNF 

SEPCPPYPQQ 



10 

HPPD_ 
HPPD_ 

hppd' 
hppd" 

HPPD_ 
15 HPPD_ 
HPPD_ 
MELA_ 
PEA3" 



Hv 
Ath 
HUMAN 
^RAT 
PIG 
MOUSE 
^PSESP 
[SHECO 
MOUSE 



310 

FT GFHEF 

FT GFHQF 

YLKNLQFHRF 
YLKNLQFHRF 
YMRNLQFHRF 
YLKNLQFHRF 
^EKLFNFREI 
YKDIFGFTEV 
NFKQ-EYHDP 



320 

AEFTAEDVGT 
AEFTADDVGT 
WSVDDTQVHT 
WSVDDTQVHT 
WSVDDTQIHT 
WSVDDTQVHT 

RYF DIKG 

RYF DIKG 

LYEQAGQPAS 



330 

TESGLNSWL 
AESGLNSAVL 
EYSSLRSIW 
EYSSLRSIW 
EYSALRSWM 
EYSSLRSIW 
EYTGLTSKAM 
SQTALISYAL 
SQGGVSGHRY 



340 

ANNSEGVLLP 
ASNDEMVLLP 
ANYEESIKMP 
ANYEESIKMP 
ANYEESIKMP 
TNYEESIKMP 
TAPDGMIRIP 
RSPDGSFCIP 
PGAGWIKQE 



350 

LNEPVHGTKR 
INEPVHGTKR 
INEPAPG-KK 
INEPAPG-RK 
INEPAPG-KK 
INEPAPG-RK 
LNE — ESSKG 
INE — GKGDD 
RTDFAYDSDV 



20 HPPD_ 

hppd" 

HPPD_ 

hppd' 

HPPD_ 
HPPD_ 
25 hppd] 

mela] 

PEA3" 



Hv 
Ath 

"human 

^RAT 

PIG 

MOUSE 

"pSESP 

^SHECO 

"mouse 



360 

RSQIQTFLEH 
KSQIQTYLEH 
KSQIQEYVDY 
KSQIQEYVDY 
KSQIQEYVDY 
KSQIQEYVDY 
AGQIEEFLMQ 
RNQIDEYLKE 
PGCASMYLHP 



370 

HGGPGVQH-I 
NEGAGLQH-L 
NGGAGVQH-I 
NGGAGVQH-I 
NGGAGVQH-I 
NGGAGVQH-I 
FNGEGIQH-V 
YDGPGVQH-L 
EGFSGPSPGD 



380 

AVASSDVLRT 
ALMSEDIFRT 
ALKTEDIITA 
ALRTEDIITT 
ALKTEDIITA 
ALKTEDIITA 
AFLSDDLIKT 
AFRSRDIVAS 
GVMGYGYEKS 



390 

LRKMRARSAM 
LREMRKRSSI 

IRHLRER 

IRHLRER 

IRSLRER 

IRHLRER 

WDHLKSI 

LDAMEGS 

LRPFPDDVCI 



400 

GGFDFLPPPL 
GGFDFMPSPP 
-GLEFLSVP- 
-GMEFLAVP- 
-GVEFLAVP- 
-GTEFLAAP- 
-GMRFMTAPP 
-SIQTLDIIP 
VPKKFEGDIK 



HPPD_HV 
30 HPPD_Ath 
HPPD_HUMAN 
HPPD_RAT 
HPPD_PIG 
HPPD_MOUSE 
HPPD_PSESP 
MELA SHECO 



35 



PEA3 MOUSE 



410 

PKYYEGVRRL 
PTYYQNLKKR 
STYYKQLREK 
SSYYRLLREN 
FTYYKQLQEK 
SSYYKLLREN 
DTYYEMLEGR 
E-YYDTIFEK 
QEGIGAFREG 



440 



450 



420 430 

AGD — VLSEA QIKECQELGV LVDRDDQG VLL 

VGD — VLSDD QIKECEELGI LVDRDDQG — — TLL 

LKTAKIKVKE NIDALEELKI LVDYDEKG YLL 

LKTSKIQVKE NMDVLEELKI LVDYDEKG YLL 

LKSAKIRVKE SIDVLEELKI LVDYDEKG YLL 

LKSAKIQVKE SMDVLEELHI LVDYDEKG — YLL 

HGE PVGELQARGI LLDGSSESGD KRLLL 

LPQ VTE DRDRIKHHQI LVDGDEDG YLL 

PPYQR RGALQLWQF LVALLDDPTN AHFIAWTGRG 



HPPD_Hv 

HPPD_Ath 

40 HPPD_HUMAN 

HPPD_RAT 

HPPD_PIG 

HPPD_MOUSE 

HPPD_PSESP 

MELA_SHECO 

PEA3 MOUSE 
45 — 



460 

QIFTKPVGDR 
QIFTKPLGDR 
QIFTKPVQDR 
QIFTKPMQDR 
QIFTKPMQDR 
QIFTKPMQDR 
QIFSETLMGP 
QIFTKNLFGP 
MEFKLIEPEE 



470 

PTLFLEMIQR 
PTIFIEIIQR 
PTLFLEVIQR 
PTLFLEVIQR 
PTVFLEVIQR 
PTLFLEVIQR 
— VFFEFIQR 
— IFIEIIQR 
VARLWGIQKN 



480 490 500 

IGCMEKDERG EE YQKG GCGGFGKGNF 

VGCMMKDEEG KA YQSG GCGGFGKGNF 

HNHQ GFGAGNF 

HNHQ GFGAGNF 

HUHQ GFGAGNF 

HNHQ GFGAGNF 

KGDD- GFGEGNF 

KNNL- GFGEGNF 

RPAMNYDKLS RSLRYYYEKG IMQKVAGERY 



HPPD Hv 



510 520 530 540 

-SE LFK-SIE-DY — EKS LEA KQSAAV-QGS 



550 



0050/48141 



HPPD_Ath 

HPPD_HUMAN 

HPPD_RAT 

HPPD_PIG 

HPPD_MOUSE 

HPPD_PSESP 

MELA_SHECO 

PEA3 MOUSE 



26 

SE LFK-SIE-EY — EKT — LEA KQLVG 

NS LFK-AFEEEQ — NLRGNLTN METNGWPGM 

^ NS LFK-AFEEEQ — ALRG 

NS LFK-AFEEEQ — ELRGNLTD TDPNGVPFRL 

NS LFK-AFEEEQ — ALRGNLTD LEPNGVRSGM 

KA LFE-SIERDQ — VRRGVLST -D 

KA LFE-SIERDQ — VRRGVL 

VYKFVCEPEA LFSLAFPDNQ RPALKAEFDR PVSEEDTVPL SHLDESPAYL 



560 



570 



HPPD_ 

10 hppd' 

HPPD^ 
HPPD, 

hppd] 

HPPD_ 

hppd] 
mela] 

PEA3'_ 
Key: 



20 



_Hv 
Ath 

[human 
]rat 

]PIG 

[mouse 

[PSESP 

[sheco 
"mouse 



25 



30 



35 



PELTGPAPPF GHRGGYSY 

HPPD_Hv: Hordeum vulgare 4-hydroxyphenylpyruvate 

dioxygenase ( HvSD3 6 ) 

HPPD_Ath: Arabidopsis thaliana 

4-hydroxyphenylpyruvate dioxygenase 

HPPD_HUMAN: H. sapiens 4-hydroxyphenylpyruvate 

dioxygenase 

HPPD_PIG: pig 4-hydroxyphenylpyruvate dioxygenase 

HPPD__RAT: rat F alloantigen 

HPPD_MOUSE: mouse 4-hydroxyphenylpyruvate 

dioxygenase 

MELA_SHECO: S, colwelliana melA protein 
HPPD_PSESP: Pseudomonas sp. (strain P. J, 874) 
4 -hydroxypheny Ipyruvate dioxygenase 

PEA3_MOUSE: Mus musculus (mouse) PEA3 polypeptide 

The greatest homology was with the Arabidopsis sequence, 
viz, 58% over the entire sequence (62% over 412 amino 
acids), followed by HPPD_RAT with 35% (over 365 amino 
acids), HPPD_HUMAN 34% (over 365 amino acids), HPPp_MOUSE 
34% (over 371 amino acids). 



Example 4 

Raising barley {Hordeum vulgare) 

40 

Barley seedlings (Hordeum vulgare L. cv. Carina, Ackermann 
Saatzucht, Irbach, Germany) were raised over a period of 15 days 
under controlled conditions in a controlled-environment cabinet 
in so-called Mitscherlich pots in soil containing 4 g of Osmocote 
45 5M (Urania, Heunburg, Germany) per liter. To ensure uniform 

growth, the seeds were germinated on moist filter paper in the 
dark for 2 days at 4^0 and 1 day at 21^0, and only those seedlings 
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were planted which showed the same longitudinal growth of the 
primary root. After these seedlings had been transferred onto 
soil, they were covered with screened soil to a depth of 1.5 cm. 
Thereafter, the plants were incubated for 9 days at 16 hours light 
5 (120 Mm-m-2-s~i) and 8 hours darkness in conjunction with a 

temperature shift (21^0 during the day, 16^0 during the night). 
After 9 days, the plants were kept for 2 days (days 10 and 11) in 
the dark at the abovementioned temperature in order to induce 
senescence. 



Example 5 
Raising tobacco 

The tobacco plants were raised following the known method. The 
15 tobacco cultivar used is Nicotlana tabacum cv. Xanthi. 

Example 6 

Transformation of tobacco 

20 The expression cassette according to the invention comprising the 
HPPD gene with Sequence 1 was cloned into vector pBinAR-Hyg 
(Fig. 4). Tobacco plants as described in Example 5 were 
subsequently transformed with this vector following the known 
method. 

25 

Example 7 

Increasing the tocopherol biosynthesis in tobacco 

The HPPD cDNA was provided with a CaMV 35S promoter and 
30 overexpressed in tobacco using the 35S promoter. In parallel, the 
seed-specific phaseolin gene promoter was used to increase the 
tocopherol content specifically in the tobacco seed. Tobacco 
plants which had been transformed with the relevant constructs 
were raised in the greenhouse. The a-tocopherol content of the 
35 total plant and of the seeds of the plant was subsequently 
determined. In all cases, the a-tocopherol concentration was 
increased in comparison with the untransf ormed plant. 



10 



40 



45 
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GENERAL INFORMATION 



(iii) 

(iv) 



APPLICANT 

(A) NAME: BASF AG 

STREET: Carl Bosch 
TOWN : Ludwigshaf en 
FEDERAL COUNTRY: Germany 
POSTCODE: 67056 
TELEPHONE : 0621-60-52698 
TITLE OF APPLICATION: HPPD sequence 
parley 

OF SEQUENCES: 2 
dPUTER-READABLE FORM: 



(B) 
(C) 
(D) 
(F) 
(G) 



(A) 
(B) 
(C) 
(D) 

INFORMATION ON 
(i) 




from 



(ii) 
(iii) 
(iii) 
(vi) 



(vii) 



(xi) 

CGCACACC ATG CCG CCC 
Met Pro Pro 
1 



:CORDING MEDIUM: floppy disk 
>MPUTER: IBM PC compatible 

SYSTEM: PC-DOS /MS-DOS 
WARE: Patentin release #1*0, Version 
#1.2^ EPA) 
SEQ "I^NO : 1 : 
SEQUENCE CHkE^CTERISTICS: 

(A) LENGTH: \565 base pairs 

(B) TYPE: nuclseic acid 

( C ) STRANDEDNESfi : double 

( D ) TOPOLOGY : 1 rine ar 
MOLECULE TYPE: cm 
HYPOTHETIC: NO 
ANTISENSE: NO 
ORIGINAL SOURCE: 
(A) ORGANISM: hppd fr^ barley 
(D) DEVELOPMENTAL STAGB^: senescence 
IMMEDIATE SOURCE: 

(A) LIBRARY: lambda FlXliSy^ library of barley 

(B) CLONE: pHvSD36*seq 
FEATURES : 

(A) NAME /KEY: CDS 

(B) POSITION: 9.. 13 13 
PUBLICATION DETAILS: 
(A) AUTHORS: Krupinska, Karin 

TITLE: Overexpression of 
JOURNAL: overexpression of 
DATE: 1998 

RELEVANT RESIDUES IN SEQ ID N^: 1 FROM 1 
TO 1565 

SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
ACC CCC ACC ACC CCC GCG GCT ACC GGC dCC GCC 
Thr Pro Thr Thr Pro Ala Ala Thr Gly Ala Ala 
5 10 



(B) 
(C) 
(G) 
(K) 



50 
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GCC GCG GTG ACG CCG GAG CAC GCG CGA CCG CAC CGA ATG GTC CGC TTC 98 
Ala Ala Val Thr Pro Glu His Ala Arg Pro His Arg Met Val Arg Phe 
15 20 25 30 

AAC CCG CGC AGC GAC CGC TTC CAC ACG CTC TCC TTC CAC CAC GTC GAG 146 
Asn Pro Arg Ser Asp Arg Phe His Thr Leu Ser Phe His His Val Glu 

35 40 45 

TTC TGG TGC GCG GAC GCC GCC TCC GCC GCC GGC CGC TTC GCG TTC GCG 194 
Phe Trp Cys Ala Asp Ala Ala Ser Ala Ala Gly Arg Phe Ala Phe Ala 
50 55 60 

CTC GGC GCG CCG CTC GCC GCC AGG TCC GAC CTC TCC ACG GGG AAC TCC 242 
Leu Gly Ala Pro Leu Ala Ala Arg Ser Asp Leu Ser Thr Gly Asn Ser 
65 70 75 

GCG CAC GCC TCC CAG CTG CTC CGC TCG GGC TCC CTC GCC TTC CTC TTC 290 
Ala His Ala Ser Gin Leu Leu Arg Ser Gly Ser Leu Ala Phe Leu Phe 
80 85 90 

ACC GCG CCC TAC GCC AAC GGC TGC GAC GCC GCC ACC GCC TCC CTG CCC 338 
Thr Ala Pro Tyr Ala Asn Gly Cys Asp Ala Ala Thr Ala Ser Leu Pro 
95 100 105 110 

TCC TTC TCC GCC GAC GCC GCG CGC CGG TTC TCC GCC GAC CAC GGG ATC 386 
Ser Phe Ser Ala Asp Ala Ala Arg Arg Phe Ser Ala Asp His Gly lie 

115 120 125 

GCG GTG CGC TCC GTA GCG CTG CGC GTC GCA GAC GCC GCC GAG GCC TTC 434 
Ala Val Arg Ser Val Ala Leu Arg Val Ala Asp Ala Ala Glu Ala Phe 
130 135 140 

CGC GCC AGT CGT CGA CGG GGC GCG CGC CCG GCC TTC GCC CCC GTG GAC 482 
Arg Ala Ser Arg Arg Arg Gly Ala Arg Pro Ala Phe Ala Pro Val Asp 
145 150 155 

CTC GGC CGC GGC TTC GCG TTC GCG GAG GTC GAG CTC TAC GGC GAC GTC 530 
Leu Gly Arg Gly Phe Ala Phe Ala Glu Val Glu Leu Tyr Gly Asp Val 
160 165 170 

GTG CTC CGC TTC GTC AGC CAC CCG GAC GGC ACG GAC GTG CCC TTC TTG 578 
Val Leu Arg Phe Val Ser His Pro Asp Gly Thr Asp Val Pro Phe Leu 
175 180 185 190 

CCG GGG TTC GAG GGC GTA ACC AAC CCG GAC GCC GTG GAC TAC GGC CTG 626 
Pro Gly Phe Glu Gly Val Thr Asn Pro Asp Ala Val Asp Tyr Gly Leu 

195 200 205 

ACG CGG TTC GAC CAC GTC GTC GGC AAC GTC CCG GAG CTT GCC CCC GCC 674 
Thr Arg Phe Asp His Val Val Gly Asn Val Pro Glu Leu Ala Pro Ala 
210 215 220 
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GCA GCC TAG ATC GCC GGG TTC ACQ GGG TTC CAC GAG TTC GCC GAG TTC 722 

Ala Ala Tyr lie Ala Gly Phe Thr Gly Phe His Glu Phe Ala Glu Phe 
225 230 235 

ACG GCG GAG GAC GTG GGC ACG ACC GAG AGC GGG CTC AAC TCG GTG GTG 770 
Thr Ala Glu Asp Val Gly Thr Thr Glu Ser Gly Leu Asn Ser Val Val 
240 245 250 

CTC GCC AAC AAC TCG GAG GGC GTG CTG CTG CCG CTC AAC GAG CCG GTG 818 
Leu Ala Asn Asn Ser Glu Gly Val Leu Leu Pro Leu Asn Glu Pro Val 
255 260 265 270 

CAC GGC ACC AAG CGC CGG AGC CAG ATA CAG ACG TTC CTG GAA CAC CAC 866 
His Gly Thr Lys Arg Arg Ser Gin lie Gin Thr Phe Leu Glu His His 

275 280 285 

GGC GGC CCG GGC GTG CAG CAC ATC GCG GTG GCC AGC AGT GAC GTG CTC 914 
Gly Gly Pro Gly Val Gin His lie Ala Val Ala Ser Ser Asp Val Leu 
290 295 300 

AGG ACG CTC AGG AAG ATG CGT GCG CGC TCC GCC ATG GGC GGC TTC GAC 962 
Arg Thr Leu Arg Lys Met Arg Ala Arg Ser Ala Met Gly Gly Phe Asp 
305 310 315 

TTC CTG CCA CCC CCG CTG CCG AAG TAC TAC GAA GGC GTG CGA CGC CTT 1010 
Phe Leu Pro Pro Pro Leu Pro Lys Tyr Tyr Glu Gly Val Arg Arg Leu 
320 325 330 

GCC GGG GAT GTC CTC TCG GAG GCG CAG ATC AAG GAA TGC CAG GAG CTG 1058 
Ala Gly Asp Val Leu Ser Glu Ala Gin He Lys Glu Cys Gin Glu Leu 
335 340 345 350 

GGT GTG CTC GTC GAT AGG GAC GAC CAA GGG GTG TTG CTC CAA ATC TTC 1106 
Gly Val Leu val Asp Arg Asp Asp Gin Gly Val Leu Leu Gin He Phe 

355 360 365 

ACC AAG CCA GTA GGG GAC AGG CCG ACC TTG TTC CTG GAG ATG ATC CAG 1154 
Thr Lys Pro Val Gly Asp Arg Pro Thr Leu Phe Leu Glu Met He Gin 
370 375 380 

AGG ATC GGG TGC ATG GAG AAG GAC GAG AGA GGG GAA GAG TAC CAG AAG 1202 
Arg He Gly Cys Met Glu Lys Asp Glu Arg Gly Glu Glu Tyr Gin Lys 
385 390 395 

GGT GGC TGC GGC GGG TTC GGC AAA GGC AAC TTC TCC GAG CTG TTC AAG 1250 
Gly Gly Cys Gly Gly Phe Gly Lys Gly Asn Phe Ser Glu Leu Phe Lys 
400 405 410 

TCC ATT GAA. GAT TAC GAG AAG TCC CTT GAA GCC AAG CAA TCT GCT GCA 1298 
Ser He Glu Asp Tyr Glu Lys Ser Leu Glu Ala Lys Gin Ser Ala Ala 
415 420 425 430 
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GTT CAG GGA TCA TAGGATAGAA GCTGGTCCTT GTATCATGGT CTCATGGAGC 1350 
Val Gin Gly Ser 

435 

AAAAGAAAAC AATGTTGTTT GTAATATGCG TCGCACAATT ATATCAATGT TATAATTGGT 1410 
GAAGCTGJU^G ACAGATGTAT CCTATGTATG ATGGGTGTAA TGGATGGTAG AGGGGCTCAC 1470 
ACATGAAGAA AATGTAGCGT TGACATTGTT GTACAATCTT GCTTGCAAGT AAAATAAAGA 1530 
ACAGATTTTG AGTTCTGCAA AAAAAAT^AAA AAAAA 1565 
(2) INFORMATION ON SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 amino acids 

(B) TYPE: cunino acid 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Pro Pro Thr Pro Thr Thr Pro Ala Ala Thr Gly Ala Ala Ala Ala 
1 5 10 15 

Val Thr Pro Glu His Ala Arg Pro His Arg Met Val Arg Phe Asn Pro 
20 25 30 

Arg Ser Asp Arg Phe His Thr Leu Ser Phe His His Val Glu Phe Trp 
35 40 45 

Cys Ala Asp Ala Ala Ser Ala Ala Gly Arg Phe Ala Phe Ala Leu Gly 
50 55 60 

Ala Pro Leu Ala Ala Arg Ser Asp Leu Ser Thr Gly Asn Ser Ala His 
65 70 75 80 

Ala Ser Gin Leu Leu Arg Ser Gly Ser Leu Ala Phe Leu Phe Thr Ala 

85 90 95 

Pro Tyr Ala Asn Gly Cys Asp Ala Ala Thr Ala Ser Leu Pro Ser Phe 
100 105 110 

Ser Ala Asp Ala Ala Arg Arg Phe Ser Ala Asp His Gly lie Ala Val 
115 120 125 

Arg Ser Val Ala Leu Arg Val Ala Asp Ala Ala Glu Ala Phe Arg Ala 
130 135 140 

Ser Arg Arg Arg Gly Ala Arg Pro Ala Phe Ala Pro Val Asp Leu Gly 
145 150 155 160 
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7^ Gly Phe Ala Phe Ala Glu Val Glu Leu Tyr Gly Asp Val Val Leu 
\ 165 170 175 

Arg PhV Val Ser His Pro Asp Gly Thr Asp Val Pro Phe Leu Pro Gly 
\ 180 185 190 

Phe Glu Gl\ Val Thr Asn Pro Asp Ala Val Asp Tyr Gly Leu Thr Arg 
195\ 200 205 

Phe Asp His VaI\Val Gly Asn Val Pro Glu Leu Ala Pro Ala Ala Ala 
210 \ 215 220 

Tyr lie Ala Gly Phe\Thr Gly Phe His Glu Phe Ala Glu Phe Thr Ala 
225 2^0 235 240 

Glu Asp Val Gly Thr ThrNciu Ser Gly Leu Asn Ser Val Val Leu Ala 

245 \ 250 255 

Asn Asn Ser Glu Gly Val LeuxLeu Pro Leu Asn Glu Pro Val His Gly 
260 \ 265 270 

Thr Lys Arg Arg Ser Gin lie GlnSThr Phe Leu Glu His His Gly Gly 
275 280 \ 285 

Pro Gly Val Gin His lie Ala Val Ala\Ser Ser Asp Val Leu Arg Thr 
290 295 \ 300 

Leu Arg Lys Met Arg Ala Arg Ser Ala Met\Gly Gly Phe Asp Phe Leu 
305 310 \l5 320 

Pro Pro Pro Leu Pro Lys Tyr Tyr Glu Gly Veil Arg Arg Leu Ala Gly 

325 330 \ 335 

Asp Val Leu Ser Glu Ala Gin lie Lys Glu Cys Gin Glu Leu Gly Val 
340 345 \ 350 

Leu Val Asp Arg Asp Asp Gin Gly Val Leu Leu Gin \le Phe Thr Lys 
355 360 3V5 

Pro Val Gly Asp Arg Pro Thr Leu Phe Leu Glu Met Ilev Gin Arg lie 
370 375 380 \ 

Gly Cys Met Glu Lys Asp Glu Arg Gly Glu Glu Tyr Gin LJrs Gly Gly 
385 390 395 • \ 4O0 

Cys Gly Gly Phe Gly Lys Gly Asn Phe Ser Glu Leu Phe Lys^er lie 

405 410 Xl5 

Glu Asp Tyr Glu Lys Ser Leu Glu Ala Lys Gin Ser Ala Ala V^ Gin 
420 425 430 \ 



Gly Ser 



